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AMENDMENTS TO THE CLAIMS 




Listing of Claims: 

1. (currently amended) A method of handling data for 
determining functional similarity between portions of gene 
expression profiles in a computer comprising the steps of: 
processing a number of gene expression profiles with a similar 
sequences algorithm that is a time and intensity invariant 
correlation function to obtain a data set of gene expression 
pro file pairs and a match fraction for each gene expression 
profile pair; 

listing gene expression profile pairs in clusters by their 
match fractions; 

removing a first gene express.i on profile from a cluster when 
another cluster has another gene expression profile wj th a 
higher match fraction with the first gene expression prof ile, 
unless the another gene expression profile requires a larger 
number of subsequences to achieve similarity with the first 
gene expression profil e; 

repeating the removing step until all gene expression profiles 
are listed in only one clustery 

providing output of t he listing of clusters of gone 
expression p rofiles * 



CHA9 2003 0003 USl -3- 
Serial No: 10/829,448 



PAGE 5120 " RCVD AT 911 112006 10:31:58 AM [Eastern Daylight Time] 1 SVR:USPTO-EFXRF-2/17 1 DNIS:2738300 ■ CSID:uscltlb4 1 DURATION (mm-ss):04-18 



Filed: July 29, 2003 



2006-09-11 10:22 



use 1 1 1 b4 



704-594-8307 » USPTO 



P 6/20 



2. (currently amended) A method of handling da ta lor 
determining functional similarity between portions of gene 
expression profiles comprising the steps of: 

processing a number of gene expression profiles with a similar 
sequences algorithm that is a time and intensity invariant 
correlation function to obtain a data set of gene expression 
pairs and a match fraction for each pair; 

listing gene expression pairs in clusters by their match 
fractions; 

removing a first gene from a first cluster when Lhe first gene 
is also in a second cluster which has another gene with a 
higher match fraction with the first gene than any of the genes 
in the first, cluster have with the first gene, but; 
retaining the first gene in the first cluster and removing the 
first gene from the second cluster when the difference between 
the highest match fraction of the first gene with a gene in the 
first cluster and the highest match fraction of the first gene 
with a gene in the second cluster is less than a minimum 
difference threshold and the number of subsequences represented 
in the similar qene pair having the highest match fraction in 
the first cluster is higher than the number of subsequences 
represented in the similar gene pair having the highest match 
fraction in the second cluster; 

repeating the removing step until all genes are listed in only 
one clustery 

provid in g output of the lis ting of clusters o f gene_ 
expression profiles. 
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3. (currently amended) A method of determining f unotiof fr a - l 
similarity between portions of gene expression profiles 
comprising the steps of: 

processing the data embodying a number of gene expression 
profiles with a similar sequences algorithm in a comput er that 
is a time and intensity invariant correlation function to 
obtain a data set of gene expression pairs and a match fraction 
for each pair; 

choosing a threshold match fraction; 

listing gene expression pairs in clusters by their match 
fractions above the threshold; 

adding each gene not already in a cluster to a cluster having 
another gene having a highest match fraction with the each gene 
without regard of the threshold; 

removing a first gene from a cluster when the first gene .is 
also in another cluster which has another gene with a higher 
match fraction with the first gene than any of the genes in the 
cluster have with the first gene; 

repeating the removing step until all genes are listed in only 
one clustery 

providing o utput of the l ist ing of cluster s o f gene 
expres sion profile s . 
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4. (currently amended) A method of handling dat a for 
determining f unctional similarity between portions of gene 
expression profiles comprising the steps of: 

processing a number of gene expression profiles with a similar 
sequences algorithm that is a time and intensity invariant 
correlation function with a computer to obtain a data set of 
gene expression pairs and a match fraction for each pair; 
choosing a threshold match fraction; 

listing gene expression pairs in clusters by their match 
fractions above the threshold; 

adding each gene not already in a cluster to a cluster having 
another gene having a highest match fraction disregarding the 
threshold with the each gene; 

removing a first gene from a first cluster when the first gene 
js also in a second cluster which has another gene with a 
higher match fraction with the first gene than any of the genes 
in the first cluster have with the first gene, but; 
retaining the first gene in the first cluster and removing the 
first gene from the second cluster when the difference between 
the highest match fraction of the first gene with a gene in the 
first cluster and the highest match fraction of the first gene 
with a gene in the second cluster is less than a minimum 
difference threshold and the number of subsequences represented 
in the similar gene pair having the highest match fraction in 
the first cluster is higher than the number of subsequences 
represented in the similar gene pair having the highest match 
fraction in the second clusters- 
repeating the removing and retaining steps until all genes are 
listed in only one clustery 

providing output of th e l isting of clust ers of gene 
expression profi l c s - 
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5. (currently amended) A method of handling data in a computer 
for determining functional similarity between genes comprising 
the steps of: 

listing genes to be compared in a data set by their gene 
expression profiles; 

processing the listed gene expression profiles with a similar 
sequences algorithm that is a time and intensity invariant 
correlation function to obtain a data set of gene expression 
pairs and a match fraction for each pair; 
choosing a threshold match fraction; 

creating a set G in which to list indices of genes accounted 
for; 

assigning genes i and j to a cluster a if they have a match 
fraction qreatcr than the threshold; 

assigning gene k to the cluster a if it has a match fraction 
greater than the threshold with either gene i or gene j ; 
assigning genes k and 1 to a cluster b if they have a match 
fraction greater than the threshold and if both gene k and gene 
1 do not have match fractions above the threshold with either 
gene i or gene j; 

repeating the assigning steps until all genes to be compared 
have been considered; 

removing a first gene from a cluster when another cluster has 
another gene with a higher match fraction with the first gone; 
repeating the removing step until all genes are listed in only 
one clustery 

providin g output of th e listing of clu s ters of gene 
exp ression pro fi les - 
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6. (currently amended) ft method of handling data in a computer 
for determining funct ionoJ similarity between genes comprising 
the steps of: 

listing genes to be compared in a data set by their gene 
expression profiles; 

processing the listed gene expression profiles with a similar 
sequences algorithm that is a time and intensity invariant 
correlation function to obtain a data set of gene expression 
pairs and a match fraction fox each pair; 
choosing a threshold match fraction; 

creating a set G in which to list indices of genes accounted 
for; 

assigning genes i and j to cluster 1 if they have a match 
fraction greater than the threshold; 

assigning gene k to cluster 1 if it has a match fraction 
greater than the threshold with either gene i or gene i ; 
assigning genes k and 1 to cluster 2 if they have a match 
fraction greater than the threshold and if both gene k and gene 
1 do not have match fractions above the threshold with either 
gene i or gene j ; 

removing a first gene from a cluster when another cluster has 
another gene with a higher match fraction with the first gene, 
unless the another gene requires a larger number of 
subsequences to achieve similarity with the first gene; 
repeating the removing step until all genes are listed in only 
one clustery 

provid in g output of t he listing of clu st ers of gene 
expression p ro files . 
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7 . (canceled) 

8 . (cancel ed) 

9. (canceled) 

1 0. (currently amended) A program product having computer 
readable code stored on a recordable media for determining 
i - unction , jl similarity between portions of gene expression 
prof i les comprising : 

programmed means for processing a number of gene expression 
profiles with a similar sequences algorithm that is a time and 
intensity invariant correlation function to obtain a data set 
of gene expression pairs and a match fraction for each pair; 
programmed means for listing gene expression pairs in clusters 
by their match fractions; 

programmed means for removing a first gene from a cluster when 
the first gene is also in another cluster which has another 
gene with a higher match fraction with the first gene than any 
of the genes in the cluster have with the first gene; 
programmed means for repeating the removing step unti 1 all 
genes are listed in only one cluster. 
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11 . (current 1 y amended) A proqram product having computer 
readable code stored on a recordable media for determining 
f unnt.i nnn] similarity between portions of gene expression 
profiles using output from a similar sequences algorithm that 
is a time and intensity invariant correlation lunction 
comprising: 

programmed means for providing a gene expression profile data 
sol as input to programmed means embodying a similar sequences 
algorithm that is a time and intensity invariant correlation 
function to obtain a data set of gene expression pairs and a 
match fraction for each pair as output from the programmed 
means embodying a similar sequences algorithm; 

programmed means for listing the gene expression pairs in 
clusters by their match fractions; 

programmed means for removing a first qene from a cluster when 
the first gene Is also in another cluster which has another 
gene with a higher match fraction with the first gene than any 
of the genes in the cluster have with the first gene; 
programmed means for repeating the removing step until all 
genes arc listed in only one cluster. 
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12 , (currently amended) A program product having computer 
readable code slored on a recordable media for determining 

similarity between portions of gene expression 
profiles comprising the steps ol: 

programmed means for processing a number oi gene expression 
profiles with a similar sequences algorithm that is a Lime and 
intensity invariant correlation function to obtain a data set 
of gene expression pairs and a match fraction for each pair; 
programmed means for listing gene expression pairs in clusters 
by thoir match fractions; 

programmed means for removing a first gene from a first 
cluster when the first gene is also in a second cluster which 
has another gene with a higher match fraction with the first 
gene than any of the genes in the first cluster have with the 
first gene, but/ 

programmed means for retaining the first gene in the first 
cluster and removing the first gene from the second cluster 
when the difference between the highest match fraction of the 
first gene with a gene in the first cluster and the highest 
match fraction of the first gene with a qene in the second 
cluster is less than a minimum difference threshold and the 
number of subsequences represented in the similar gene pair 
having the highest match fraction in the first cluster is 
higher than the number oi subsequences represented in the 
similar gene pair having the highest match fraction in the 
second cluster; 

programmed means for repeating the removing step until all 
genes are listed in only one cluster. 
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13. (currently amended) A program product having computer 
readable code stored on a recordable media for determining 
funct.i onnl similarity between portions of gene expression 
profiles comprising the steps of: 

programmed means for processing a number of gene expression 
profiles with a similar sequences algorithm that is a time and 
intensity invariant correlation function to obtain a data set 
of gene expression pairs and a match fraction for each pair; 
programmed means for choosing a threshold match fraction ; 
programmed means for listing gene expression pairs in clusters 
by their match fractions above the threshold; 

programmed means for adding each gene not already in a cluster 
to a cluster having another gene having a highest match 
fraction with the each gene without regard of the threshold; 
programmed means for removing a first gene from a cluster when 
the first gene is also in another cluster which has another 
gene with a higher match fraction with the first gene than any 
of the genes in Lhe cluster have with the first gene; 
programmed means for repeating the removing step until all 
genes are listed in only one cluster. 
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14. (currently amended) A program product, havinq computer 
readable code stored on a recordable media for determining 
functional similarity between portions of gene expression 
profiles comprising the stops of: 

programmed means for processing a number of gene expression 
profiles with a similar sequences algorithm that is a time and 
intensity invariant correlation function to obtain a data set 
of gene expression pairs and a match fraction for each pair; 
programmed means for choosing a threshold match fraction; 
programmed means for listing gene expression pairs in clusters 
by their match fractions above the threshold; 

programmed means for adding each gene not. already in a cluster 
to a cluster having another gene having a highest match 
fraction disregarding the threshold with the each gene; 
programmed means for removing a first gene from a first 
cluster when the first gene is also in a second cluster which 
has another gene with a higher match fraction with the first 
gene than any of the genes in the first cluster have with the 
first gene, but; 

programmed means for retaining the first gene in the first 
cluster and removing the first gene from the second cluster 
when the difference between the highest match fraction of the 
first gene with a gene in the first cluster and the highest- 
match fraction of the first gene with a gene in the second 
cluster is less than a minimum difference threshold and the 
number of subsequences represented in the similar gene pair 
having the highest match fraction in the first cluster is 
higher than the number of subsequences represented in the 
similar gene pair having the highest match fraction in the 
second cluster; 

programmed means for repeating the removing and retaining 
steps until all genes are listed in only one cluster. 
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1.5. (currently amended) IK program product having computer 
readable code stored on a recordable media lor determining 
f uncti -ofttfrjr similarity between genes comprising the steps oi : 
programmed means for listing genes to be compared by their 
gene expression profiles; 

programmed means for processing the listed gene expression 
profiles with a similar sequences algorithm that is a Lime and 
intensity invariant correlation function to obtain a data set 
of gene expression pairs and a match fraction for each pair; 
programmed means tor choosing a threshold match fraction; 
programmed means for creating a null set G(0) to hold genes 
accounted for; 

programmed means for assigning genes i and j to cluster 1 if 
they have a match fraction greater than the threshold; 
programmed means for assigning gene k to cluster 1 if it has a 
match fraction greater than the threshold with either gene i or 
gene j; 

programmed means for assigning genes k and 1 to cluster 2 ii 
they have a match fraction greater than the threshold and if 
both gene k and gene 1 do not have match fractions above the 
threshold with either gene i or gene j; 

programmed means for removing a first gene from a cluster when 
another cluster has another gene with a higher match fraction 
with the first gene; 

programmed means for repeating the removing step until all 
genes are listed in only one cluster. 
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16. (currently amended) A program product having computer 
readable code stored on a recordable media for determining 
functional similarity between genes comprising the steps of: 
programmed means for listing genes to be compared by their 
gene expression profiles; 

programmed means for processing the listed gene expression 
profiles with a similar sequences algorithm that is a time and 
intensity invariant correlation function to obtain a data set 
of gene expression pairs and a match fraction for each pair; 
programmed means for choosing a threshold match fraction; 
programmed means for creating a null set G(0) to hold genes 
accounted for; 

programmed means for assigning genes i and j to cluster 1 if 
they have a match fraction greater than the threshold; 
programmod means for assigning gene k to cluster 1 if it has a 
match fraction greater than the threshold with either gene i or 
gene, j ; 

programmed means for assigning genes 1c and 1 to cluster 7 if 
they have a match fraction greater than the threshold and if 
both gene k and gene 1 do not have match fractions above the 
threshold w.i th either gene i or gene j; 

programmed means for removing a first gene from a cluster when 
another cluster has another gene with a higher match fraction 
with the first gene, unless the another gene requires a larger 
number of subsequences to achieve similarity with the first 
gene; 

programmed means for repeating the removing step until all 
genes are listed in only one cluster. 
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17 . (canceled) 

18 . (canceled) 

19. (canceled) 

20. (currently amended) In a method ol determining functional 
similarity between portions of gene expression profiles which 
includes processing a number of gene expression profiles using 
a compute r with a similar sequences algorithm that is a time 
and intensity invariant correlation function to obtain a data 
set of gene expression pairs and a match fraction for each 
pair f the improvement comprising the steps oft 

Listing gene expression pairs in clusters by their match 
fractions; 

removing a first gene from a cluster when another cluster has 
another gene with a higher match fraction with the first gene, 
unless the another gene requires a larger number of 
subsequences to achieve similarity with the first gene; 
repeating the removing step until all genes are listed in only 
one cluster; 

p roviding output o f the listi ng of clust e rs of gen e 
e xpressio n profile s . 
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